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Abstract 


In this work we develop the concept of generalized probabilities, which can take real negative values, with basis on 


multiset concepts and the Jaccard similarity index. In particular, multisets are generalized to real values, allowing a 


respective real-valued Jaccard index to be obtained which, as with the classic Jaccard index for sets, can be understood 


as representing a respective generalized probability. 


“Does negative probability have a negative probability?’ 





LdaFC 


1 Introduction 


Probability has almost invariably been understood as tak- 
ing non-negative values, which gives rise to the concept 
of respective density probability functions. Yet, there are 
some circumstances, as in quantum mechanics, where neg- 
ative probabilities arise. 

In the present work, we develop a possible definition 
of real-valued probabilities taking values in the interval 
[—1, 1]. First, we review the concepts of non-negative and 
real-valued multisets and their main operations [1, 2, 3, 4]. 
Multisets are sets that allow repetition of their elements 
(e.g. [5, 6, 7, 8, 9, 10). 

Subsequently, we show that the classic Jaccard similar- 
ity index can be understood as a well-defined probability 
definition. Generalized probabilities are then obtained 
from the real-valued version of the Jaccard similarity in- 
dex. 


2 Non-Negative Real- Valued 


Multisets 


Let S4 be the support of a multiset A, therefore contain- 
ing all its possible elements a;, i = 1,2,...,Na. 

A multiset A is a set of 2-tuples [a;,m(a;)], where 
m(a;) > 0 is the multiplicity of the elements a; in the 
multiset A. 


Let A and B be two generic multisets. Their combined 
support corresponds to S = S4 U Sg. For simplicity’s 
sake, in this case we write: 


S = {ai} (1) 


where x;, i = 1,2,...,Naup are all the possible ele- 

ments to be found in A or B. 
A multiset A is contained or identical to another mul- 
tiset B, i.e. A C B, if and only if: 
malzi) < Mmpg(zi), Va, eS (2) 


The union of two multisets A and B is a new multiset 
C given as: 


C = { [z:;, max {m4 (2:1), mg (2:)}] I (3) 


The sum of two multisets A and B is a new multiset C 
given as: 


C = { [xi,ma(ai) + Mmg (z:)] h (4) 


The intersection of two multisets A and B is a new 
multiset C given as: 


C = { [z:, min {m4 (2:), mg (x:)}] b (5) 


The difference between two multisets A and B is a new 
multiset C given as: 


C = { [z:;; ma (zi) — mg(z:) |} (6) 


The universe multiset respective to multisets A and B 
can be defined as: 


Q4p=AUB (7) 


where U means the multiset union. 

The universe multiset is always defined respectively to 
a group of multisets of interest. 

The empty multiset respective to a multiset A is given 
as: 


D4 = { [24,0] } (8) 
The complement of a multiset A is given as: 
AF = 64 — A = { [z;, —ma(zi) |} (9) 


All properties analogous to those of conventional sets 
are inherited by multisets under the above premises. For 
instance, we have that: 


AUQa = 4 [xi, max {ma (zi), ma (xr) =A (10) 
AN ®y, = { [z;, min {m4 (zx:;), 0} } = 4 (11) 
Qa Ua = 04 (12) 
Qa Na = 04 (13) 


3 Real-Valued Multiset 


A real-valued multiset A is a set of 2-tuples [a;, m(a;)], 
where m(a;) € R is the multiplicity of the elements a; in 
the multiset A. 

All real-valued multiset operations are analogous to the 
non-negative real multisets, with additional operations as 
follows. 

The common union of two multisets A and B is a new 
multiset C given as: 


C = AU B = {s4,,B; [t;,max{s4,A;i,58,B;}|} (14) 


where A; = ma(a;) and Bi = mpg(zi). 
The common intersection of two multisets A and B is 
a new multiset C given as: 


C = AN B = {s4,,B; [£i min {s4; Ai, sg; Bi} } (15) 


So, in the case of a single multiset A, we have that: 


























ANQA=A (16) 
AUQ,=24=A (17) 
ANa =ð4 (18) 
AUA =A (19) 
AnN-A=ANA? =-—A (20) 
AU-A=AUA?=-A (21) 


The absolute union of two multisets A and B is a new 
multiset C given as: 


C = AUB = { [x;, max {s 4, Ai, sB; Bi}] |} (22) 


where A; = m4(z;) and B; = mp(2;). 
The absolute intersection of two multisets A and B is 
a new multiset C given as: 


C = ANB = 4 [x;, min {s4, Ai, sB, Bi} } (23) 


4 The Jaccard Index as Probabil- 
ity 

The classic Jaccard index for expressing the similarity be- 

tween two sets A and B is defined as: 


_ |AnBI 


Ter |AU B] 





(24) 


with 0 < J(A, B) < 1. 

Interestingly, nothing is commonly said or assumed 
about the respective universe set Q. However, it can be 
inferred that necessarily we need to enforce that: 


ACQ (25) 
BO (26) 


Therefore, the smallest possible universe is given as [1]: 


Q=AUB (27) 


However, in case A C B or B C A, we immediately 
have that Q = B. No, the respective Jaccard index is 
given as: 


A 
P(A) = JAD) = 5 (28) 
From which we conclude that the Jaccard index actu- 
ally corresponds precisely to the notion of probability. 


5 Generalized Probabilities 


Now, we have that the Jaccard index has been recently 
generalized, through an extension of the concept of multi- 
sets to consider real and possibly negative multiplicities, 
as the real-valued Jaccard index [1, 2, 3], being defined as: 





ANB 3g 8a,,B;, min {s4,Ai, $B, Bi} 














A, B) = —— = 
JR(A, B) AŬB dog max {84, Ai, SB, Bi} 
(29) 
with —1 < Jr(A, B) < 1 and where: 
A; = ma(ai) (30) 
B; = mp(zi) (31) 
sa, = sign(A;) (32) 
$B, = sign(B;) (33) 
5A,,Bi = $A; $B; (34) 
a, €S=S45,i=1,2,...,Naup (35) 


where N is the number of distinct elements in A and 
B. 


By direct analogy with Equation 28, we now make 
A C B = QA pg and so obtain a generalized defini- 
tion of probability that can take values in the interval 
—1 < P(A) < 1 as: 





P(A) = JrR(A, 94B) = das Ai 


E dog max {84, Ai, $B, Bi} (35) 


6 Real Funtions 


All the above results and properties generalized to real 
functions. 

For instance, given two functions f(t) and g(t) with 
combined support S = Sy, we have that their union 
corresponds to: 


h(x) = max { f(x), 9(@) (37) 


All the other operations are analogously obtained. 
The real-valued Jaccard index for real functions then 
becomes: 





fng = Ís ss g min {sf f(x), sgg(x)dz} 
fõg Js max {ss f (2), sgg(x)dx} 











Jr(f, g) = 





And now we are in position of defining the probability of 
a function f(x) with respect to a multiset universe wy,(z) 
as: 


P(f(x)) = IR(f, wy) = 
fn w fa f(x)dx 
fuw yg Jo max {sf f(x), 8..,,0f9(x)} 





(39) 





7 Concluding Remarks 


The concept of probability has been generalized to nega- 
tive values. After reviewing multisets generalized to real- 
values [1, 2], and showing that the classic Jaccard index 
can be though as a probability, we combine these two rea- 
soning lines in order to obtain a possible definition of real- 
valued probabilities constrained in the interval [—1, 1]. 
These concepts can be extended to real functions, allowing 
the definition of the probability of a function with respect 
to a given respective real-valued universe multiset. 
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